Investigating Entropy for Extractive Document Summarization
نویسندگان
چکیده
Automatic text summarization aims to cut down readers’ time and cognitive effort by reducing the content of a document without compromising on its essence. Ergo, informativeness is prime attribute summary generated an algorithm, selecting sentences that capture essence primary goal extractive summarization. In this paper, we employ Shannon’s entropy sentences. We Non-negative Matrix Factorization (NMF) reveal probability distributions for computing terms, topics, in latent space. present information theoretic interpretation computed entropy, which bedrock proposed E-Summ unsupervised method The algorithm systematically applies principle informative from important topics document. generic fast, hence amenable use documents real time. Furthermore, it domain-, collection-independent agnostic language Benefiting strictly positive NMF factor matrices, transparent explainable too. standard ROUGE toolkit performance evaluation four well known public data-sets. also perform quantitative assessment quality semantic similarity w.r.t original Our investigation reveals though using approach promises efficient, explainable, independent summarization, needs be bolstered match deep neural methods.
منابع مشابه
Extractive Document Summarization
We present two novel and contrasting Recurrent Neural Network (RNN) based architectures for extractive summarization of documents. The Classifier based architecture sequentially accepts or rejects each sentence in the original document order for its membership in the final summary. The Selector architecture, on the other hand, is free to pick one sentence at a time in any arbitrary order to pie...
متن کاملSemi-extractive Multi-document Summarization
In this thesis, I design a Maximum Coverage problem with KnaPsack constraint (MCKP) based model for extractive multi-document summarization. The model integrates three measures to detect important sentences including Coverage, rewards sentences in regards to their representative level of the whole document, Relevance, focuses to select sentences that related to the given query, and Compression,...
متن کاملExtractive spoken document summarization for information retrieval
The purpose of extractive summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a target summarization ratio and then sequence them to form a concise summary. In the paper, we proposed the use of probabilistic latent topical information for extractive summarization of spoken documents. Various kinds of modeling...
متن کاملExtractive Multi-document Summarization Using Multilayer Networks
Huge volumes of textual information has been produced every single day. In order to organize and understand such large datasets, in recent years, summarization techniques have become popular. These techniques aims at finding relevant, concise and non-redundant content from such a big data. While network methods have been adopted to model texts in some scenarios, a systematic evaluation of multi...
متن کاملClassify or Select: Neural Architectures for Extractive Document Summarization
We present two novel and contrasting Recurrent Neural Network (RNN) based architectures for extractive summarization of documents. The Classifier based architecture sequentially accepts or rejects each sentence in the original document order for its membership in the final summary. The Selector architecture, on the other hand, is free to pick one sentence at a time in any arbitrary order to pie...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Expert Systems With Applications
سال: 2022
ISSN: ['1873-6793', '0957-4174']
DOI: https://doi.org/10.1016/j.eswa.2021.115820